EUROPEAN ORGANISATION FOR NUCLEAR RESEARCH (CERN) 



CERN-PH-EP-201 2-255 
Submitted to: European Physical Journal C 



Measurement of the flavour composition of dijet events in pp 
collisions at y/s = 7 TeV with the ATLAS detector 



o 

(N 
+-> 

O 

O : The ATLAS Collaboration 

Cl; Abstract 
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and bottom jets are identified in the dijet events, where the jet flavour is defined by the presence of 
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of displaced decay vertices and optimised for jet flavour identification, are used in a multidimensional 
template fit to measure the fractions of these dijet flavour states as functions of the leading jet trans- 
(\j ; verse momentum in the range 40 GeV to 500 GeVand jet rapidity \y\ <2.1. The fit results agree with the 
predictions of leading- and next-to-leading-order calculations, with the exception of the dijet fraction 
composed of bottom and light flavour jets, which is underestimated by all models at large transverse 
jet momenta. The ability to identify jets containing two ft-hadrons, originating from e.g. gluon splitting, 
is demonstrated. The difference between bottom jet production rates in leading and subleading jets is 
consistent with the next-to-leading-order predictions. 
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Abstract This paper describes a measurement of the flavour 
composition of dijet events produced in pp collisions at s/s — 
7 TeV using the ATLAS detector. The measurement uses the 
full 2010 data sample, corresponding to an integrated lumi- 
nosity of 39 pb _1 . Six possible combinations of light, charm 
and bottom jets are identified in the dijet events, where the 
jet flavour is defined by the presence of bottom, charm or 
solely light flavour hadrons in the jet. Kinematic variables, 
based on the properties of displaced decay vertices and op- 
timised for jet flavour identification, are used in a multidi- 
mensional template fit to measure the fractions of these di- 
jet flavour states as functions of the leading jet transverse 
momentum in the range 40 GeV to 500 GeV and jet rapid- 
ity |y| < 2.1. The fit results agree with the predictions of 
leading- and next-to-leading-order calculations, with the ex- 
ception of the dijet fraction composed of bottom and light 
flavour jets, which is underestimated by all models at large 
transverse jet momenta. The ability to identify jets contain- 
ing two fo-hadrons, originating from e.g. gluon splitting, is 
demonstrated. The difference between bottom jet produc- 
tion rates in leading and subleading jets is consistent with 
the next-to-leading-order predictions. 

Keywords heavy flavour jet ■ b-jet production • dijet event 



1 Introduction 

A study of the production of jets containing bottom and 
charm hadrons, which are likely to have originated from bot- 
tom or charm quarks, is of strong interest for an understand- 
ing of Quantum Chromodynamics (QCD). Charm and bot- 
tom quarks have masses significantly above the QCD scale, 
Aqcd, and hence their production can be calculated reliably 
using perturbation theory. They are thus excellent probes of 
the underlying QCD dynamics. 



Several mechanisms contribute to heavy flavour quark 
production, such as quark-antiquarkpair creation in the hard 
interaction or in the parton showering process. While the 
former is calculable in a perturbative approach, the latter 
may require large non-perturbative corrections or different 
approaches such as a heavy quark mass expansion. In inclu- 
sive heavy flavour jet cross-sections, the contributions from 
different heavy flavour quark production mechanisms are in- 
distinguishable. This complicates a comparison with theo- 
retical calculations. In contrast, a more exclusive study of 
the production of dijet events containing heavy flavour jets 
allows the contributions of different heavy flavour quark cre- 
ation processes to be separated. For example, the dominant 
QCD production mechanisms are different for pairs of bot- 
tom flavour jets and pairs consisting of one bottom and one 
light jet. In this context, a measurement of the flavour com- 
position of dijet events sheds light on the details of the dif- 
ferent QCD processes involving heavy quarks. 

The dijet system can be decomposed into six flavour 
states based on the contributing jet flavours. The jet flavour 
is defined by the flavour of the heaviest hadron in the jet. 
A light jet originates from fragmentation of a light flavour 
quark (u, d and s) or gluon and does not contain any bottom 
or charm hadrons. Three of these dijet states are the symmet- 
ric bottom+bottom (bb), charm+charm (cc) and light+light 
jet pairs. The three other combinations are the flavour- 
asymmetric bottom+light, charm+light and bottom+charm 
jet pairs. In the following discussion, these six dijet flavour 
states will be denoted BB, CC, UU, BU, CU, BC, where U 
stands for light, C for charm and B for bottom jet. 

Inclusive bottom jet and bb production in hadronic colli- 
sions have been studied by several experiments QJ[5] in the 
past, see also a review [6 | and references therein. Recently 
CMS published cross-sections for inclusive bottom jet pro- 
duction [7 1, bb decaying to muons [8| and bottom hadron 
production Q, as well as BB angular correlations [ 10 1. The 
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bb cross-section was also measured by LHCb 111 11 . ATLAS 
published a measurement of the bb cross-section in proton- 
proton collisions at -Js = 7 TeV [12|, employing explicit 
b-jet identification (^-tagging). However, the bb final state 
constitutes only a small fraction of the total heavy flavour 
quark production in dijet events, and the inclusive bottom 
cross-section contains a significant contribution from multi- 
jet states. This paper presents a simultaneous measurement 
of all six dijet flavour states, including those with charm. 
The BC, CC and CU dijet production at the LHC is stud- 
ied for the first time. This approach provides more detailed 
information about the contributing QCD processes and chal- 
lenges the theoretical description of the underlying dynam- 
ics employed in QCD Monte Carlo simulations. 

The analysis procedure exploits reconstructed secondary 
vertices inside jets. Since kinematic properties of secondary 
vertices depend on the jet flavour, a measurement of the indi- 
vidual contributions of each flavour can be made by employ- 
ing a fit using templates of kinematic variables. No explicit 
^-tagging is used, i.e. no flavours are assigned to individual 
jets. The excellent separation of charm and bottom flavoured 
jets in the ATLAS detector is demonstrated in the analysis. 

The analysis uses the data sample collected by ATLAS 
at y/s = 7 TeV in 2010, corresponding to an integrated lu- 
minosity of 39 pb _1 . The prescale settings of the different 
single-jet triggers used in the analysis varied with luminos- 
ity such that the actual recorded luminosity is dependent on 
the transverse momentum pj of the leading jet. 

This paper is organised as follows. The ATLAS detec- 
tor is briefly described in Sect. [2] Section [3] describes the 
event and jet selection procedure for data and Monte Carlo 
simulation. Section [4] summarises the Monte Carlo simula- 
tion. Section [5] discusses the theoretical predictions for the 
flavour composition of dijet events. The reconstruction of 
secondary vertices in jets as well as the kinematic templates 
for the flavour analysis are presented in Sect. [6] A detailed 
account of the analysis method is given in Sect. [7] In Sect. [8] 
the results of the analysis are presented and systematic un- 
certainties are discussed. 



2 The ATLAS detector 

The ATLAS detector [13] was designed to allow the study of 
a wide range of physics processes at LHC energies. It con- 
sists of an inner tracking detector, surrounded by an elec- 
tromagnetic calorimeter, hadronic calorimeters and a muon 
spectrometer. For the measurements presented in this paper, 
the tracking devices, the calorimeters and the trigger system 
are of particular importance. 

The innermost detector, the tracker, is divided into three 
parts: the silicon pixel detector, the closest layer lying 
5.05cm from the beam axis, the silicon microstrip detector 



and the transition radiation tracker, with the outermost layer 
situated at 1 .07 m from the beam axis. These offer full cover- 
age in the azimuthal angle <p and a coverage in pseudorapid- 
ity of |tj I < 2.50. The tracker is surrounded by a solenoidal 
magnet of 2T, which bends the trajectories of charged parti- 
cles so that their transverse momenta can be measured. The 
liquid argon and lead electromagnetic calorimeter covers a 
pseudorapidity range of |t}| < 3.2. It is surrounded by the 
hadronic calorimeters, made of scintillator tiles and iron in 
the central region {\t]\ < 1.7) and of copper/tungsten and 
liquid argon in the endcaps (1.5 < |tj| < 3.2). A forward 
calorimeter extends the coverage to \rj\ < 4.9. The muon 
spectrometer comprises three layers of muon chambers for 
track measurements and triggering. It uses a toroidal mag- 
netic field with a bending power of 1-7.5 Tm and provides 
precise tracking information in a range of |tj | < 2.7. 

The ATLAS trigger system ifPTl uses three consecutive 
levels: level 1 (LI), level 2 (L2) and event filter (EF). The 
LI triggers are hardware-based and use coarse detector in- 
formation to identify regions of interest, whereas the L2 trig- 
gers are based on fast online data reconstruction algorithms. 
Finally, the EF triggers use offline data reconstruction algo- 
rithms. This study uses single-jet triggers. 

3 Event and jet selection 

Selected events are required to have at least one recon- 
structed primary vertex candidate. A candidate vertex must 
have at least 10 tracks with transverse momentum pj > 
150 MeV associated to it, to ensure the quality of the ver- 
tex fit. If several vertex candidates are reconstructed, the one 
with the largest sum of the squared transverse momenta of 
associated tracks is considered to be the main interaction 
vertex and used as the primary vertex in the following. 

Jets are reconstructed using the anti-£ r algorithm with 
a jet radius parameter R = 0.4 fl4l . Topological clusters of 
energy deposits in the calorimeters are used as input for 
the clustering algorithm. Tracks within a cone of AR = 
\J (Aq>) 2 + (Arj) 2 — 0.4 around the jet axis are assigned 
to the jet. Only jets with a transverse momentum of pj > 
30 GeV and a rapidity of \y\ < 2.1 are considered. Jets in 
this rapidity range are fully contained in the tracker accep- 
tance region, such that track and vertex reconstruction in- 
side jets are not affected by the boundaries of the tracker ac- 
ceptance. Jets are furthermore required to pass a quality se- 
lection lfT31l that removes jets mimicked by noisy calorime- 

1 ATLAS uses a right-handed coordinate system with its origin at the 
nominal interaction point (IP) in the centre of the detector and the z- 
axis along the beam pipe. The *-axis points from the IP to the centre 
of the LHC ring, and the j-axis points upward. Cylindrical coordinates 
(r, 0) are used in the transverse plane, tf> being the azimuthal angle 
around the beam pipe. The pseudorapidity is defined in terms of the 
polar angle 6 as rj = — lntan(0/2). 
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Leading jet pj [GeV] 


40-60 


60-80 


80-120 


120-160 


160-250 


250-500 


Subleading jet pj [GeV] 


30-60 


40-80 


50-120 


75-160 


100-250 


140-500 


Number of events 


304103 


251406 


887185 


660168 


242979 


146117 


/Ldf[nb-'] 


70 


247 


1880 


8640 


8640 


38700 



Table 1 Kinematic boundaries, together with the numbers of selected dijet events and the corresponding integrated luminosities for each leading 
jet pj bin. 



ter cells or those that stem from non-collision backgrounds. 
Finally, the two jets with highest pj in the analysis accep- 
tance are required to have an angular separation in azimuth 
of Aq> > 2.1rad, i.e. to be consistent with a back-to-back 
topology. This cut removes events in which one of the lead- 
ing jets is produced by final-state hard gluon emission or jet 
splitting in the reconstruction. 

The full data sample is split into six bins in the trans- 
verse momentum pj of the leading jet. The bin boundaries 
correspond to the 99% efficiency thresholds of the vari- 
ous single-jet triggers 1161 . For events passing the trigger 
requirement, the leading and subleading jets have to ful- 
fil pairwise-specific pj conditions that are summarised in 
Table [T] The numbers of events selected in each leading jet 
pj bin are shown in TableQ] together with the corresponding 
integrated luminosities. 



4 Monte Carlo simulation 

Dijet events are simulated using Pythia 6.423 ifTTIl for 
the baseline template construction, parameter estimation and 
Monte Carlo (MC) comparisons. This leading-order (LO) 
generator is based on parton matrix-element calculations for 
2^2 processes and a string hadronisation model. Modified 
leading-order MRST LO* [ 18 1 parton distribution functions 
are used in the simulation. Samples of dijet events were gen- 
erated using a specific set of generator parameters, known as 
the ATLAS Minimum Bias Tune 1 (AMBT1) fl9l . 

For the study of systematic effects and for the in- 
terpretation of the final results, other Monte Carlo sam- 
ples are utilised. The main cross-check study is performed 
using the Herwig++ 2.4.2 ||2p| generator. The other LO 
samples used are Pythia with the next-to-leading-order 
(NLO) CTEQ 6.6 1 2 1 1 parton distribution functions and Her- 
wig 6.5 E21 used with Jimmy 4 |23]|24] for the simula- 
tion of multiple parton interactions, using a specific ATLAS 
Underlying Event Tune (AUET1) [25 1. The possible influ- 
ence of multiple proton-proton interactions within the same 
bunch crossing is studied by adding minimum bias events, 
customised to the beam conditions of the 2010 LHC run at 
7 TeV, to each PYTHIA event. 

The Pythia 6.423+EvtGen ||26) event generator, us- 
ing charm and bottom decay matrix elements with all se- 



quential decay correlations and measured branching ra- 
tios, where available, is utilised for the simulation of the 
physics of bottom and charm hadron decays. It will be called 
Pythia+EvtGen in the rest of the paper. 

The NLO generator POWHEG 12714301 is used to inter- 
pret the analysis results. In POWHEG, the parton distribution 
function set used for the event generation is MSTW 2008 
NLO [31 1 and the parton shower generator is PYTHIA. 

In order to compare Monte Carlo predictions with data, 
"truth-particle" jets are used. They are defined by the anti-fc, 
R = 0.4 algorithm using only stable particles with a lifetime 
longer than lOps in the Monte Carlo event record. Muons 
and neutrinos do not contribute significantly to the jet energy 
in data. Therefore, they are also excluded from the truth- 
particle jets, to avoid having to correct for the missing jet 
energy in data. 

The flavour of jets is assigned in the Monte Carlo sim- 
ulation by labelling a jet as a b-jet if a bottom hadron with 
pj > 5 GeV is found within a cone AR = 0.3 around the jet 
axis. If no bottom hadron is present but a charm hadron is 
found using the same requirements, then the jet is labelled 
as a c-jet. All other jets are labelled as light jets. If two bot- 
tom hadrons with pj > 5 GeV are found within a cone of 
size AR = 0.3 the jet is labelled as a b-jet with two bottom 
hadrons, and similarly for c-jets with two charm hadrons. 

The particle four-momenta are passed through the full 
simulation 021 of the ATLAS detector, which is based on 
GEANT4 ll33l . The simulated events are reconstructed and 
selected using the same analysis chain as for data. After the 
dijet event selection, the Monte Carlo events are re weighted 
in each analysis pj bin to match the observed leading and 
subleading jet pj spectra. Any remaining discrepancies in 
the rapidity distributions between data and simulation are 
small and are included as sources of systematic uncertainty, 
as detailed in Sect. 18.31 

5 Theoretical predictions 

5.1 Heavy flavour production 

Following the discussion in 11341 . heavy flavour quark pro- 
duction in hadronic collisions may be subdivided into three 
classes depending on the number of heavy quarks participat- 
ing in the hard scattering. Hard scattering is defined as the 
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Fig. 1 The contributions of the different production processes to inclusive 6-jet production in 7 TeV pp collisions are shown as a function of 6-jet 
pi, as given by PYTHIA 6.423 and obtained for truth-particle jets. The plot on the left (a) shows the contribution of quark pair creation, heavy 
flavour quark excitation and gluon splitting; the plot on the right (b) shows the different processes contributing to gluon splitting, namely initial- 
and final-state gluon splitting, the latter leading to jets with one or two fo-hadrons. Truth-particle jets are reconstructed with the anti-&, R = 0.4 
algorithm in the |v| < 2. 1 rapidity region. 



2—^2 subprocess with the largest virtuality (or shortest dis- 
tance) in the hadron-hadron interaction. In the following, Q 
stands for a heavy flavour quark, q for a light flavour quark 
and g for a gluon: 

- Quark pair creation: two heavy quarks are produced in 
the hard subprocess. At leading order this is described 
by gg -> QQ and qq -> QQ. 

- Heavy flavour quark excitation: a single heavy flavour 
quark from the sea of one hadron scatters against a par- 
ton from another hadron, denoted gQ — > gQ and qQ —> 
qQ, respectively. Alternatively, the heavy flavour quark 
excitation process can be depicted as an initial-state gluon 
splitting into a heavy quark pair, where one of the heavy 
quarks subsequently enters the hard subprocess. 

- Gluon splitting: in this case heavy quarks do not parti- 
cipate in the hard subprocess at all, but are produced in 
g —> QQ branchings in the parton shower. 

The relative contributions of the different heavy flavour 
quark production mechanisms to inclusive b-jet production 
are shown in Fig. |l(a)| for simulated proton-proton collisions 
at 7 TeV. The fractions are calculated for anti-A:, jets in a ra- 
pidity range of |y| < 2.1 with the PYTHIA 6.423 H3 gen- 
erator. Figure |l(b)| shows the decomposition of the gluon 
splitting process into initial- and final-state gluon splitting, 
the latter leading to jets with one or two fo-hadrons. 

The above classification is not strict but can be used as a 
basis for gaining a qualitative understanding of the features 
of heavy flavour quark production. Pair creation of heavy 
flavour quarks gives an insight into perturbative QCD with 



massive quarks. The back-to-back requirement used in the 
analysis reduces the contribution of NLO QCD effects to the 
jet-pair cross-sections with two heavy flavour jets, BB and 
CC. The heavy flavour quark excitation process, on the other 
hand, is sensitive to the heavy flavour components of the par- 
ton distribution functions of the proton. It produces mainly 
flavour asymmetric BU and CU jet pairs. The gluon splitting 
mechanism is sensitive to non-perturbative QCD dynamics 
and also contributes significantly to the mixed flavour jet 
pair states, i.e. BU and CU. However, this contribution is 
different from heavy flavour quark excitation because it cre- 
ates a heavy quark-antiquark pair. The jet reconstruction al- 
gorithm either includes both heavy quarks in a single jet or 
misses one of them, thus reducing the reconstructed jet en- 
ergy and its fraction taken by the remaining quark. The two 
possibilities result in different kinematic properties of the 
reconstructed secondary vertices in these jets, which can be 
exploited for the separation of gluon splitting from the heavy 
flavour quark excitation contribution. 

To compare the predictions of theoretical models with 
data, the truth-particle jets defined in Section [4] are used in 
the analysis. The truth-particle dijet system is defined as the 
two truth-particle jets with the highest pj in the \y\ < 2.1 
rapidity range, required to be consistent with a back-to-back 
topology, Aq> > 2.1rad, with both the leading and sublead- 
ing jets having pj > 20 GeV. 

The leading-order predictions for flavour jet production 
in truth-particle dijet events are illustrated in Fig. [2] where 
the ratio of different heavy+heavy and heavy+light dijet 
cross-sections to the total dijet cross-section is shown for 
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Fig. 2 PYTHIA 6.423 predictions for different bottom and charm dijet 
fractions as a function of leading jet pj, obtained for truth-particle jet 
pairs, where the jets are back-to-back and have pj > 20 GeV in the 
|y| < 2. 1 rapidity region. 



\y\ < 2.1 as a function of leading jet pj, for 7 TeV pp col- 
lisions as predicted by Pythia 6.423. Heavy flavour jets 
in the dijet system are mainly produced in the BU and CU 
combinations. Pythia 6.423 predicts a slow decrease of the 
BB and CC fractions and an increase of the BU and CU jet 
fractions as a function of the leading jet pj. The mixed BC 
fraction increases with jet pj and becomes equal to the BB 
fraction above ~ 350 GeV. 



5.2 Differences in heavy flavour rates in leading and 
subleading jets 

The kinematic properties of the partons produced in hadro- 
nic interactions are mostly flavour independent, if mass ef- 
fects are neglected. The two back-to-back partons with the 
highest pj in the event should therefore not show any sig- 
nificant flavour-dependent difference in their kinematic fea- 
tures. However, the partons can be studied only through the 
corresponding jet properties after hadronisation. Heavy fla- 
vour quark presence in a jet can influence the jet properties 
through the following mechanisms: 

- Semileptonic decays of heavy flavour hadrons decrease 
the jet energy, because neutrinos are not detected and the 
muon energy is not measured in the calorimeter. This 
energy loss is absent for light jets and is very different 
for bottom and charm jets. 

- If several heavy flavour quarks appear in the jet fragmen- 
tation process (e.g. via gluon splitting) one of them can 
be left outside the jet volume by the jet reconstruction 
algorithm, which leads to a reduction in the jet energy. 

As a result, the average jet energy for heavy flavours be- 
comes smaller than the jet energy for light flavours, such 



that heavy flavour jets are predominantly produced as sub- 
leading jets in the mixed-flavour dijet pairs. This effect can 
be described using a flavour asymmetry defined as 



(1) 



L SL 

where N b ' denote the number of leading or subleading bot- 
tom or charm jets. The predictions for Aj, c given by different 
Monte Carlo generators are shown in Fig.[3]for the truth par- 
ticle jets defined in Sect. [4] 
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Fig. 3 The asymmetries in the amount of (a) bottom and (b) charm 
truth particle jets as taken from POWHEG+PYTHIA 6.423 (black 
points), PYTHIA 6.423 (squares), Herwig++ 2.4.2 (triangles) and 
PYTHIA+EVTGEN (open squares) in leading and subleading jets, for 
each leading jet pj bin used in the analysis. 



Powheg, which includes higher-order QCD effects, pre- 
dicts a significant flavour asymmetry which increases strong- 
ly with jet pj. The flavour asymmetry predictions of the 
LO Pythia generator are smaller than those of the NLO 
Powheg generator. The latter uses Pythia 6.423 for the 
fragmentation and thus shares the same description of the 
decays of heavy flavour hadrons. Since the influence of the 
different parton distribution functions was also found to be 
negligible, the differences in Af, c between these generators 
(Fig. [3]) should be attributed primarily to NLO QCD effects. 
The LO Herwig++ generator employs another fragmenta- 
tion model and predicts asymmetries similar to the POWHEG 
ones, although with a somewhat different pj dependence. 

For the measurement of the dijet flavour fractions, this 
flavour asymmetry needs to be correctly described in the 
data analysis. The fact that the Monte Carlo generators pre- 
dict significantly different asymmetries indicates that A/, c 
should be determined directly from the data. 
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6 Secondary vertex reconstruction and analysis 
templates 

Secondary vertices are displaced from the primary vertex 
because they originate from the decays of long-lived par- 
ticles. Kinematic properties of these vertices, e.g. the in- 
variant mass or total energy of the outgoing particles, de- 
pend on the corresponding properties of the original heavy 
flavour hadrons and are therefore different for bottom and 
charmjets. Reconstructed secondary vertices in light jets are 
mainly due to Kg and A [35] decays, interactions in the de- 
tector material, or fake vertices. The fake reconstructed ver- 
tices are composed of tracks which occasionally get close 
together due to a high density of tracks in the jet core and 
track reconstruction errors. Their properties are very differ- 
ent from those of heavy flavour decays. The current analysis 
exploits these differences by combining the kinematic fea- 
tures of the reconstructed secondary vertices in an optimal 
way into templates for bottom, charm and light jets. 

6. 1 Secondary vertex reconstruction in jets 

The vertex reconstruction algorithm aims at a high recon- 
struction efficiency and therefore determines vertices in an 
inclusive way, i.e. a single secondary vertex is fitted for each 
jet. In the case of a bottom hadron decay, the subsequent 
charm hadron decay vertex is usually close to the bottom one 
and is therefore not reconstructed separately. A detailed dis- 
cussion of the algorithm and its performance can be found in 
the ^-tagging chapter of Ref. [ 16 1. The reconstruction starts 
by combining pairs of good quality tracks inside jets to make 
vertices, where the latter are required to be displaced signif- 
icantly from the primary interaction vertex. The two-track 
vertices coming from and A decays and interactions in 
the detector material are removed from further considera- 
tion. For the light jets, the remaining candidates after this 
cleaning are mainly fake vertices. All remaining two-track 
vertices are merged into a single vertex. This vertex is re- 
fitted iteratively by removing tracks until a good vertex fit 
quality is obtained. The corresponding decay length is de- 
fined as a signed quantity, where the sign is fixed by the 
projection of the decay length vector — the vector pointing 
from the primary event vertex to the secondary vertex — onto 
the jet axis. The vertex is required to have a positive de- 
cay length and a total invariant mass, calculated using the 
momenta of associated particles and assigning them pion 
masses fl35l . greater than 0.4 GeV. 

6.2 Secondary vertex reconstruction efficiencies 

The secondary vertex reconstruction efficiency is dependent 
on the jet pj due to several effects such as the pj depen- 



dence of the track reconstruction accuracy and the increase 
of the flight distance of heavy flavour hadrons with grow- 
ing jet pj. The probability of reconstructing a fake vertex 
in a light jet is also affected by the increase of the number 
of tracks in a jet with jet pj. Due to the p-p-dependent ver- 
tex efficiency and different pj distributions for leading and 
subleading jets in dijet pairs, the number of reconstructed 
secondary vertices in these jets are different. 

The secondary vertex reconstruction efficiencies pre- 
dicted by the ATLAS detector simulation based on dijet 
events from Pythia 6.423 are shown in Fig. [4] There is 
no difference between secondary vertex reconstruction effi- 
ciencies in leading and subleading jets for charm and bot- 
tom jets. However, the fake vertex reconstruction probabil- 
ity in light jets is noticeably higher for subleading jets. This 
requires the introduction of two separate secondary vertex 
probabilities for leading and subleading light jets. 

6.3 Template construction and features 

The specific choice of the kinematic variables for the dijet 
flavour measurement is driven by the requirement to have 
maximal sensitivity to the flavour content. Furthermore, if 
several variables are to be used, the correlations between 
them should be kept small. Another important requirement 
is a minimal dependence on the jet pj and rapidity, in or- 
der to minimise systematic effects due to a possible pj or 
rapidity mismatch between data and Monte Carlo simula- 
tion. Also, px-invariant variables allow a robust analysis to 
be made over a wide range of pj. 

For this study the following two variables are chosen: 

"l vertex 0.4 GeV vertex 

m B Y.Ei 

jet 

V™B- I \P%i\ 
n vertex 

a = , (3) 

m vertex ■ ^/PrJ7t 

where each sum indicates whether the summation is per- 
formed over particles associated with the secondary vertex, 
or over all charged particles in the jet. Particle transverse 
momentum and energy are denoted as pj and E, respec- 
tively. In essence, fl is the product of the invariant mass 
of the particles associated with the vertex (m V ertex) and the 
energy fraction of these particles with respect to all charged 
particles in the jet. The 0.4 GeV constant in Eq.© is the cut 
value used for the secondary vertex selection in this analysis. 
The parameter B corresponds approximately to the relativis- 
tic y factor of the system composed of the particles asso- 
ciated with the vertex, normalised to the square root of the 
jet transverse momentum. The mj = 5.2794 GeV constant 
is the average B-meson mass [35 1 and is used for normalisa- 
tion. 



7 



~ 0.2r 

§ 0.18: 

.ri 0.16 : 
o 

°- 0.14 : 

x : 

t 0.12- 

o) : 

| 0.1: 
I 0.08 ! 
0.06: 
0.04: 
0.02: 



ATLAS 

Simulation, Pythia 6.423 



***** 



Leading jet 
Subleading jet 



100 200 



(a) 



300 400 500 
Jetp [GeV] 



jS 0.55 
o 

— 0.5 

^ 0.45 
o 

a> 0.4 
| 0.35 
I 0.3 
0.25 
0.2 
0.15 
O.t 



ATLAS 

Simulation, Pythia 6.423 



Leading jet 
Subleading jet 



100 200 



(b) 



300 400 500 
Jetp [GeV] 



1 

0.9 
0.8 
0.7 
0.6 
0.5 
0.4 



ATLAS 

Simulation, Pythia 6.423 



Leading jet 
Subleading jet 



100 200 



(c) 



300 400 500 
Jetp [GeV] 



Fig. 4 The reconstruction probabilities for fake vertices in (a) light jets, as well as the reconstruction efficiencies for secondary vertices in (b) 
charm and (c) bottom jets, are displayed as a function of the jet pj as predicted by PYTHIA 6.423. 
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Fig. 5 Two-dimensional distributions of JI T and B T (flavour templates) obtained with PYTHIA 6.423 for (a) light, (b) charm and (c) bottom jets 
with px in the bin [60, 80] GeV. 



To facilitate the fit procedure, the variables are trans- 
formed into the interval [0, 1 ] : 



n 1 



B 



n 



n+o.04 

BB 
BB+10. 



(4) 



(5) 



The tuning constants 0.04 in Eq.© and 10 in Eq.® have 
been chosen to maximise the difference in the mean values 
between the light and heavy flavour distributions. 

Joint distributions of these observables are shown in 
Fig. |5] for light, charm and bottom jets in the [60, 80] GeV 
bin, as predicted by the full detector simulation of 
Pythia 6.423 events. These two-dimensional distributions 
are used as flavour templates U(n J ,B T ), C(i7 T ,B T ) and 
B(n J ,B J ) in the analysis as detailed in Section|7] Features 
of the observables are also illustrated in Figs.|6]and|7] Both 
f! T and B J are independent of jet rapidity, as can be seen 
in Fig. [6] This figure shows the light jet templates, which 



are most sensitive to reconstruction and detector effects. The 
f! T variable is very similar in shape in the [40,60] GeV and 
[250, 500] GeV bins and is only weakly /^-dependent. Fig- 
ure|7]demonstrates that f! T is only weakly dependent on the 
different heavy flavour production mechanisms described in 
Sect. [5] In contrast, the B T variable is sensitive to the gluon 
splitting contribution, in particular to the case where this 
mechanism produces two quarks of the same flavour in a 
jet. In addition B J has a distinct pj dependence. However, 
the B T variable provides good sensitivity to the charm con- 
tribution. 

The fraction of jets with two heavy quarks produced in 
gluon splitting may be incorrectly predicted by the PYTHIA 
simulation, especially in the high pj region where this con- 
tribution becomes large (see Fig. [TJ. This phenomenon was 
discussed in more detail in ll36l . Therefore a separate con- 
tribution of doubly-flavoured jets is included in the analy- 
sis, to account for the corresponding dependence of the B J 
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Fig. 6 The J7 T and B T distributions of light jets in the [(a)] [40, 60] GeV and|(b)] [250, 500] GeV leading jet (LJ) p T analysis bins obtained with 
fully simulated PYTHIA 6.423 dijet events. The distributions are shown in different jet rapidity ranges. 
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Fig. 7 The J7 T and S T distributions in the [40, 60] GeV leading jet (LJ) Pt range for |(a)| charm jets and |(c)| bottom jets as well as in the 
[250,500] GeV range for |(b)| charm jets and |(d)| bottom jets obtained with fully simulated PYTHIA 6.423 dijet events. The distributions are shown 
separately for jets stemming from quark pair creation, heavy flavour quark excitation, gluon splitting (GS) with one or two heavy flavour quarks 
inside the jet. All distributions are normalised separately to unit area. 



variable. The two-dimensional template for bottom jets is 
replaced by the two-component template 

b (n T , b t ) -> ( i - hi) ■ b (n T , b t ) + b 2 ■ b 2 (n T , b t ) , (6) 

where B 2 (n J ,B J ) is a template for jets with two fo-hadrons 
and b 2 is a parameter governing the relative contributions. 
The charm jet template is modified similarly with substitu- 
tions b 2 -> c 2 and B 2 (17 T , B T ) -> C 2 (17 T , B T ) . Using Eq.©, 
the heavy flavour template shapes can be obtained directly 
from the data by optimising the b 2 and c 2 parameters to 
achieve the best possible data description. As is demonstrated 
in Sect. [8] the adjustment of the contribution of jets with two 



b-hadrons to the bottom template significantly improves the 
overall quality of the description of the dijet data. 



6.4 Template tuning on data using track impact parameters 

The secondary vertex reconstruction algorithm uses track 
impact parameters divided by their measurement uncertain- 
ties for the vertex search, thus its results depend crucially on 
the track impact parameter resolution. A good description 
of the track impact parameter accuracy and the correspond- 
ing covariance matrix is therefore mandatory in the detector 
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simulation, in order for the secondary vertex templates to be 
constructed correctly. 

To improve the agreement between data and Monte 
Carlo simulation, the analysis templates are tuned on data. 
Firstly, an additional track impact parameter smearing is 
applied to the Pythia events. To estimate the necessary 
amount of smearing, the data and Monte Carlo track im- 
pact parameter distributions are compared in bins of track pj 
and pseudorapidity. However, the smearing procedure does 
not correct the track covariance matrices. A second step is 
therefore taken. Two sets of templates are produced, using 
both the smeared and non-smeared Pythia 6.423 samples. 
A normalised mixture is then compared with the data, using 
secondary vertices with negative decay length to obtain the 
optimal mixing fraction. These vertices depend only weakly 
on the exact flavour content of jets and are not used in the 
dijet analysis. The mixing fraction is chosen to be flavour 
independent. The optimal description of the data for the full 
pj range is obtained with a fraction F$ mear = 0.654 ±0.023 
for the smeared template in the mixture. This template tun- 
ing procedure gives a significant improvement in the data fit 
quality in the signal region. 



7 Analysis method 

7.1 Dijet system description 

The secondary vertex reconstruction procedure can find ver- 
tices with probabilities \>u, \'c ar, d v b for light, charm and 
bottom jets, respectively. For simplicity, the p-j-dependence 
of these probabilities and the differences between leading 
and subleading jets (see Sect. 16.2b are neglected for the mo- 
ment. In the leading and subleading jet of a dijet event, zero, 
one or two secondary vertices can be reconstructed overall. 
The numbers of 2-, 1-, or 0-vertex dijet events can be calcu- 
lated as: 

= vu v u.fuu + v c v c.fcc + v bvb/bb (7) 
+ v uvc.fcu + vuvb/bu + vcvb/bc 

^ = 2(1- vu) ■ v v >fuu + 2(1 - v c ) • vc-fcc (8) 

+2(1 -V B )-V B -f BB 

+ ((1 - vu) ■ vc + vu ■ (1 - vc)) ■ feu 
+ ((1 - v v ) ■ v B + v v ■ (1 - vb)) ■ Jbu 
+ ((1 - v c ) • v B + v c • (1 - vb)) ■ f BC 

Nov = N-Niv-N 2 v- (9) 



Here N is the total number of dijet events and fxx is 
the fraction of the respective dijet flavour component chosen 
such that 

fuU+fcC + fBB + fcU+fBU+fBC = l- (10) 

The joint distribution of the f! T and B T variables for 
dijet events with one reconstructed secondary vertex can be 
obtained using Eq.©: 

2>(II J \B J )=2{\-v u )v u f uu U{n 1 \B T ) (11) 
+2(l-v c )v c /ccC(iT T ,B T ) 
+2{l-v B )v B f BB B(n T ,B T ) 

+ {(l-v u )v c C(n T ,B T ) + v u (l-v c )U(n T ,B T )}fcu 
+ {(1 - v u )v B B{n J ,B J ) + v u (l - v B )U(n J ,B J )}f BU 
+ {(1 -v c )v B B(n T ,B T ) + v c (l - v B )C(n T : B T )} f BC . 

Here £$(II J ,B J ) is the observed data distribution and 
U{n T ,B T ), C(n T ,B T ) and B(n T ,B T ) are templates de- 
rived from Monte Carlo simulation with 

J u(n T ,B T )dn T dB T = J c(n T ,B T )dn T dB T (12) 
= / B(n T ,B T )dn T dB T = i. 

The case of two reconstructed vertices requires more 
careful consideration. Assuming that the two jets are inde- 
pendent, the joint distribution of f! T and B J can be written 
considering Eq.© in the following way: 

$>(nT,Bj,nJ,Bj) = (13) 

vuvufuu ■ U(nJ,Bj)U(nJ,B]) 

+v c v c fcc ■ c(nj , Bj)c(nJ , Bj ) 

+v B v B f BB -B(nJ 7 Bj)B{nJ,Bj) 
+0.5 • vuvefeu 

x { u (nj ,Bj)c(nJ,Bj) + u (nj , b 2 t )c(jt 1 t ,bJ)} 

+0.5 • vuvb/bu 

x{u(nJ,Bj)B(nJ : Bj) + u(nJ,Bj)B(nJ,Bj)} 

+0.5 • v c v B f BC 

x {C(nT,Bj)B(nJ,Bj) +c{iq,Bj)B{nJ,Bj)}. 

Provided that the templates f/(fI T ,fi T ), C(17 T ,B T ) 
and B(n T ,B T ) are given, the eight variables 
vu, v c , v B , fee, /bb, /bu, feu , !bu fully describe the 
properties of secondary vertices in ideal dijet events without 
kinematic dependencies. Note that only five fractions are 
needed, since any of the six fractions depends on the others 
through Eq.dTOli. In this paper the quantity fyu is excluded. 

The description of the dijet system must be modified to 
take into account the dijet flavour asymmetry (Sect. I5.2I >. 
The BB and CC dijet states are flavour-symmetric and thus 
do not require any modifications in their treatment. The de- 
scription of the BC dijet fraction is also left symmetric be- 
cause charm and bottom asymmetries partially compensate 
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each other and the fraction itself is small (< 0.5 %). Thus 
only the treatment of the BU and CU fractions has to be 
modified. The analysis formalism is changed in the follow- 
ing way. The sample of dijet events with only one recon- 
structed secondary vertex is split into two subsamples, ac- 
cording to whether the vertex is reconstructed in the lead- 
ing or subleading jet. These two subsamples are described 
separately, assuming different contributions of the CU and 
BU dijet fractions. More specifically, the feu and fsu co- 
efficients in Eq.® and Eq. dl lb are replaced by pairs of co- 
efficients f^y, ffiy and fg,j, fgy for leading and sublead- 
ing jets, respectively. L and SL denote here whether the 
heavy flavour is in the leading or in the subleading jet. 
The jet flavour asymmetry of Eq.(fl~|i can be rewritten as 
Ab.c — f{Bc}u/f{BC}u ~ The new equations for events 
with a reconstructed secondary vertex in the leading jet can 
then be written: 



N L 



2-(l-vu)-vv fuu +2 • (1 - vc) • v c • /( 



cc 



(14) 



+2 • (1 - V S ) • V B • f BB 



-(l- Vu ). Vc .fL u + Vu .(l- Vc ).f?L 



>cu 

+(1 - v v ) ■ vb-Ibu + vtr (1 - v B ) -flu 
+ ((1 - vc) ■ v B + v c ■ (1 - v B ))-f BC 



CU 



@ L (n T ,B T ) =2-(l- Vu ) vufuu ■ U(n T ,B T ) (15) 
+2-(l-v c )v c f cc -C(n J ,B J ) 
+2-{l-v B )v B f BB -B(n J ,B T ) 
+ (1 - v u )v c -C(n T ,B T )f^ u +v u (l-v c ) ■ U(n T ,B T )f^ 
+ (1 - Vu )v B -B{n J ,B J )f^ + Vu (l - v B ) ■ u(n T ,B T )fl^ 



In order to reduce the set of parameters in the model to 
the maximum that is affordable with the 2010 data statistics, 
additional assumptions need to be made. The charm and bot- 
tom vertex reconstruction efficiencies are defined mainly by 
heavy flavour hadron lifetimes and heavy parton fragmenta- 
tion functions, which are known well from previous exper- 
iments. Therefore, Monte Carlo predictions for v B and \<c 
are more robust than the fake vertex probability in light jets 
Vu, which is governed mainly by detector and reconstruction 
accuracies. The charm asymmetry A c is smaller than the bot- 
tom one (Fig. [3} and the admixture of jets with two charm 
quarks influences the charm template shape less than in 
the bottom case (Fig. [7j. Therefore, the following simplifi- 
cations are used in the analysis: 

- The fraction of jets with two charm quarks is set to the 
baseline Pythia 6.423 prediction c 2 = 0. 

- The charm jet asymmetry is fixed to A c = max(0,A^ IC ) 
using the Pythia 6.423 prediction, see Fig. [3] 

- The /7-r-dependent parameterisations obtained with the 
full ATLAS detector simulation (Fig.|4| are used for bot- 
tom and charm vertex reconstruction efficiencies 
vc(pt) = v^ c (pt) and v B {pi) = v^ c (p T )- 

- The light jet vertex reconstruction probabilities are para- 
metrised as Vj)(pt) = ■ v^j MC \pj) and 

v (j{pr) = ' v ^*" MC ^(pt) for leading and sublead- 



ing jets, respectively. Here Vy MC ' (pj) and 



L(MC) 



Ot) 

are the /7-r-dependent secondary vertex rates in light jets, 
obtained with the full detector simulation and shown in 
Fig- El The scaling factors sv^, sv^ are allowed to vary 
in the fit. 



{(1 — v c )v B -B(n T ,B T )+v c (l - v B ) C(TI T ,B T )} f BC . Tne final mode l has a reduced set of nine free parame- 



The corresponding equations for dijet events with a re- 
constructed secondary vertex in the subleading jet can be ob- 
tained from Eq.(fT4b and Eq.dTBTl by substituting f^ v -B- f^j 

and & v O //" 



ters 



sv ui sv u > fsB : he , fee jBuJcuAbM- 



(17) 



7.2 Data fitting function 



This simplified model is used for fitting. Systematic effects 
originating from the simplifications above are included in 
the systematic uncertainties on the flavour fraction measure- 
ments. 



The complete dijet model combines all the ingredients pre- 
sented in the previous sections. The formulae above can be 
modified to take into account the dependence of the vertex 
reconstruction efficiencies on jet pj, as well as on whether 
jets are leading or subleading (Sect. loT2l . Variable fractions 
of jets with two bottom or charm quarks inside can also be 
incorporated (Sect. 16.31 ). The full model has the following 
set of parameters: 

v\j Or ) , v S u (pt ) , v c Or ) , v B Ot) , 
/bb , f B c , fee ,fBu, feu > 

A c ,C2,AbM- 



(16) 



7.3 Validation of the analysis method 

A dedicated simulation technique was developed to vali- 
date the analysis method. It uses a set of secondary vertices, 
which are reconstructed in all jets in the dijet sample gen- 
erated with Pythia after full ATLAS simulation, and are 
stored in a dedicated database in bins of jet pj, rapidity and 
flavour. 

To produce a dijet event, the pj and \y\ values for each 
jet are generated randomly according to the corresponding 
data distributions. Jet flavours are assigned according to the 
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predefined dijet flavour fractions and the flavour asymme- 
tries (Sect. 15.21 ). The flavour-dependent vertex reconstruc- 
tion efficiencies (Fig.|4| determine whether a secondary ver- 
tex is reconstructed in the generated jet. The vertex parame- 
ters are then taken from a fully simulated secondary vertex, 
picked at random from the vertex database bin with corre- 
sponding pj and \y\. 

Two independent sets of events are generated in a pseudo- 
experiment, one for the construction of templates and one 
to define a pseudo-data sample. These pseudo-data are ana- 
lyzed, using the relevant templates, to estimate the model pa- 
rameters. Repetition of the pseudo-experiments has demon- 
strated that the fit method is able to measure the model pa- 
rameters in Eq.dTTb within a wide range of initial values. The 
estimators obtained from the fits are unbiased and have pull 
distribution dispersions close to one. 

8 Results 

8.1 Data fit results 

An event-based extended maximum likelihood fit is used to 
fit the data. The fit is performed using the MlNUlT 071 
package included in the ROOT 11381 framework. A multi- 
nomial distribution is used in the likelihood function to de- 
scribe the numbers of dijet events with zero, one or two re- 
constructed vertices. Using the MlNUlT package, a detailed 
investigation of the likelihood function in the region around 
its maximum value has been performed, to estimate the sta- 
tistical uncertainties. It has been found that the parabolic ap- 
proximation of the analysis fitting function is valid around 
the maximum point. 

The quality of the description of the data obtained with 
the fit is illustrated in Fig. [8] where the data are compared 
with the Monte Carlo distributions predicted by the fit in the 
[160,250] GeV analysis bin. All features of the data distri- 
bution are correctly reproduced, with a relative accuracy of 
better than 10%. The residual differences are within the sys- 
tematic uncertainties of the measurements. 

Figure |9j a) presents the fitted vertex probability in light 
jets together with the prediction for dijet events generated 
with Pythia 6.423 and passing through the full detector 
simulation. The probability is averaged over leading and 
subleading jets in each pj bin. The vertices found in light 
jets are mainly fake ones (Sect. 6), therefore their probabil- 
ity is very sensitive to the details of the track and vertex 
reconstruction. Good agreement between data and Monte 
Carlo simulation demonstrates that the ATLAS detector per- 
formance is well understood in the Monte Carlo simulation. 

Figure [9jb) shows the admixture of jets containing two 
bottom hadrons, b2, given by the fit. It is very different from 
the Pythia 6.423 prediction, b^ — 0, and has a small statis- 
tical uncertainty. The significance of the measured admix- 



ture confirms the importance of this additional contribution 
of double-bottom jets for a correct description of the data. 
This observation agrees with the results of 1361 . The double- 
bottom jets are produced by the gluon splitting mechanism 
(Sect. 5). However, the analysis is unable to determine if a 
contribution from this mechanism to the fraction of jets with 
a single bottom hadron (see Fig. |l(b)| i is also enhanced in 
data. 

The fit results for the Z?-jet asymmetry Ab need to be 
corrected for detector effects, in order to represent truth- 
particle jets. The necessary correction is defined as a differ- 
ence between truth-particle jet and reconstructed jet asym- 
metries, averaged over all pj bins using PYTHIA 6.423, 
Herwig++ 2.4.2 and Pythia+EvtGen dijet events. The 
resulting correction of 0.04 units is added to the fit 
results. The corrected b-jet asymmetry is compared to 
the truth-particle b-jet asymmetries in Pythia 6.423, 
POWHEG+PYTHIA 6.423 and Herwig++ 2.4.2 in Fig. 13c). 
Pythia 6.423 predicts a much smaller b-jet asymmetry than 
observed in the data. Since semileptonic decays are well 
described in PYTHIA 6.423, the undetected energy due to 
neutrinos and muons from these decays cannot be the main 
contributor to the observed b-jet asymmetry. Modifications 
of the Pythia 6.423 generator, such as different proton 
structure functions or different bottom parton fragmentation 
functions, are unable to improve substantially the agreement 
between the data and Monte Carlo simulation. The b-jet 
asymmetry predicted by Herwig++ 2.4.2 grows faster with 
pj than for the data. The best description of the data is pro- 
vided by the Powheg+Pythia 6.423 generator, suggesting 
that NLO accuracy is needed to reproduce the b-jet asymme- 
try reliably. 

8.2 Unfolding 

To allow for a comparison with theoretical predictions and 
to remove detector resolution and acceptance effects, the 
flavour fractions for data must be unfolded to the truth- 
particle jet level as defined in Sect. [5] A simple bin-by-bin 
correction method is used. The expected inaccuracy intro- 
duced by the unfolding procedure itself is small in compari- 
son with the measurement uncertainties. The unfolding cor- 
rection factors for each flavour combination and leading jet 
pi bin are determined as ratios of the reconstructed dijet 
events with required jet flavours to the corresponding truth- 
particle dijet events (Sect. |5]l in a given bin. They are calcu- 
lated using the fully simulated PYTHIA 6.432 dijet sample 
and are typically in the 60%-100% range, mainly because 
of the pj cut on the reconstructed subleading jet. The correc- 
tions are different for dijet flavour fractions in the same pj 
bin due to semileptonic decays of heavy flavour hadrons and 
different jet energy distributions for light and heavy flavour 
subleading jets. 
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Fig. 8 Data description with the Monte Carlo templates obtained as a result of the fit in the [160, 250] GeV analysis bin. (a) and (b) show the f7 T 
andB T distributions for secondary vertices in the leading jet, (c) and (d) show the same distributions for secondary vertices in the subleading jet in 
events with a single secondary vertex in two jets, (e) and (f) show the 17 T and B T distributions in the events with two secondary vertices averaged 
over leading and subleading jets. Data statistical uncertainties only are used to calculate the errors of the data to the Monte Carlo prediction ratios. 



The truth-particle dijet flavour fractions in each analysis where /, is a flavour fraction obtained in the fit and e, is the 

bin are calculated using the following formula: corresponding unfolding correction factor. The y. unfold does 

.-unfold fi/ e ' ox not coincide with the f, because all correction factors e, in a 

fi =hJfM' (18) 
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Fig. 9 Data fit results for the (a) average fake vertex probability in light jets v u , (b) 2/>jet admixture bi and (c) bottom dijet asymmetry A/,. 
Statistical uncertainties only are shown. The fake vertex probability is shown with the PYTHIA 6.423 reconstructed jet predictions. The 2&-jet 
admixture should be zero if PYTHIA 6.423 were fully consistent with data. The fitted bottom dijet asymmetry is corrected to the truth-particle jet 
level and compared with PYTHIA 6.423, Herwig++ 2.4.2 and POWHEG+PYTHIA 6.423 truth-particle jet predictions. 



given analysis bin are different, as explained earlier. Usually 
e, is smaller than one; therefore the normalisation in Eq.(fT8l) 
is needed. The unfolded flavour fractions for truth-particle 
dijet events defined in Sect. [5] are presented in Table [2] as 
well as in Fig.[l0] for the different leading jet pj bins. 

8.3 Systematic uncertainties 

The measured dijet flavour fractions are subject to system- 
atic uncertainties, due to the assumptions made in selecting 
the model parameters in Eq.(fT7b and the following effects: 

- Reconstructed jets in data and Monte Carlo simulation 
may have different kinematic properties due to trigger 
requirements, jet energy scale (JES) uncertainties, clean- 
ing cuts in the data selection procedure and event pile- 
up. 

- Differences between data and Monte Carlo simulation 
in the template shapes are possible, despite the tuning of 
the template shape to the track resolution, and the ad- 
justment of the fit to increase the fraction of jets with 
two b-quarks. 

- The JES uncertainty and differences in energy between 
light and heavy flavour jets influence the unfolding cor- 
rection factors. The template shapes are also affected by 
the remaining pj dependence of the B T variable. 

- Imperfect description of bottom and charm hadron decay 
properties in Monte Carlo generators. 

The influence of the differences in the jet pj and rapid- 
ity distributions between data and Monte Carlo simulation 
on the analysis results is estimated by using PYTHIA 6.423 
templates obtained with and without the pj and rapidity 
reweighting, respectively. The differences in the results are 
taken as systematic uncertainties. Both make only minor 



contributions to the full systematic uncertainties. The influ- 
ence of pile-up is estimated by adding minimum bias events 
to the Pythia 6.423 dijet events and repeating the analysis 
procedure. The effect is found to be negligible. 

A potential bias due to the incorrect modelling of the 
JES is estimated by varying the jet energy response by 
its uncertainty [39|. Detailed studies have shown that the 
JES uncertainty is smallest in the central calorimeter re- 
gion (\r)\ < 0.8) for jets with pj > 60 GeV, with values 
of ~ 2.5%, and that it is well below the 5% level for the 
whole kinematic range of this analysis. Both jets in a jet pair 
are varied simultaneously. An additional Z?-jet energy uncer- 
tainty is taken into account, and also applied for charm jets. 
Templates obtained from PYTHIA 6.423 events with mod- 
ified jet energies are used for the data fit. Due to the de- 
pendence of the parameterisation of the charm and bottom 
vertex reconstruction efficiencies on jet pj, these values are 
modified following the jet energy scaling. The systematic 
uncertainty due to the JES is estimated to be half of the 
difference between the fit results with positive and negative 
variation of the jet energy. The JES uncertainty is one of the 
major systematic uncertainties for all flavour fractions. In 
particular, for fsu and feu it varies from absolute values of 
0.2% and 1.1 % in the lowest p T bin, to 0.1 % and 0.8% in 
the highest pj bin. 

The charm and bottom secondary vertex reconstruction 
efficiencies are fixed in the analysis to the predictions for 
Pythia 6.423 dijet events, as explained in Sect. [7] To esti- 
mate possible deviations of these efficiencies, several Monte 
Carlo generators are used. The influences of a different 
proton structure function set (Pythia+CTEQ 6.6), a dif- 
ferent parton fragmentation function (PYTHIA+Peterson), 
a different showering model (Herwig++), different charm 
and bottom hadron decays description (Pythia+EvtGen) 
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Lead, jet pj [GeV] 


40-60 


60-80 


80-120 


120-160 


160-250 


250-500 


fm [%] 


0.65±0.04±0.12 


0.63±0.04±0.11 


0.58±0.02±0.11 


0.61±0.03±0.10 


0.58±0.05±0.07 


0.39±0.08±0.06 


fBC [%] 


0.49±0.15±0.18 


0.31±0.13±0.18 


0.53±0.08±0.19 


0.52±0.09±0.22 


0.28±0.17±0.24 


0.93±0.36±0.24 


fee [%] 


1.08±0.30±0.31 


1.51±0.29±0.33 


1.03±0.11±0.28 


0.86±0.13±0.24 


1.68±0.30±0.44 


0.70±0.47±0.50 


fm [%] 


4.07±0.14±0.45 


4.78±0.14±0.46 


5.43±0.08±0.54 


6.02±0.09±0.52 


6.55±0.17±0.42 


6.69±0.29±0.52 


feu [%] 


10.6±0.5±1.7 


10.3±0.5±1.3 


11.3±0.25±1.5 


10.9±0.24±1.8 


11.0±0.5±2.0 


12.4±0.8±2.8 


fuu [%] 


83.1±0.6±2.0 


82.4±0.5±1.7 


81.2±0.3±1.8 


81.1±0.3±2.0 


80.0±0.6±2.4 


78.9±0.9±3.6 



Table 2 The unfolded dijet flavour compositions for each leading jet pj bin, with statistical uncertainties as first entries and the full systematic 
uncertainties as second entries. 



and additional track impact parameter smearing have been 
studied. Herwig++ shows the largest deviations in the sec- 
ondary vertex reconstruction efficiency for bottom from the 
PYTHIA 6.423 Monte Carlo. The difference is — 6 % in the 
lowest pj region, but decreases to ~ 2 % in the highest pj 
region. In the case of charm, Pythia+EvtGen predicts the 
largest absolute deviations of ~ 2% from PYTHIA 6.423. 
Since the largest uncertainty in the vertex reconstruction ef- 
ficiency comes from the fragmentation model (Herwig++) 
for bottom and from the charm hadron decay description 
(EvtGen) for charm, the deviations in the charm and bot- 
tom vertex efficiencies are treated as independent for the 
systematic study. The systematic uncertainties in the flavour 
fractions are estimated by varying the charm and bottom ver- 
tex reconstruction efficiencies in the data fit by their max- 
imal deviations. The uncertainty due to the bottom vertex 
efficiency is comparable with the JES uncertainty for the 
flavour fractions with bottom, and small otherwise. Simi- 
larly, the systematic uncertainty driven by the charm vertex 
efficiency is important for the fractions with charm. 

The influence of imperfections in the Monte Carlo tem- 
plate shapes is estimated in two ways. The baseline tem- 
plates are constructed from Monte Carlo jets passing the di- 
jet selection procedure. Alternatively, one can use jets with- 
out a dijet selection. The templates obtained in this way are 
biased, due to different kinematic properties of the jets and 
changes in the contributions of the different heavy flavour 
production mechanisms. The number of contributing jets is 
also significantly larger, which makes these templates virtu- 
ally independent from the baseline ones. To extract the sys- 
tematic uncertainty, the data fit is redone with the inclusive 
jet templates. The statistical fluctuations due to the indepen- 
dent templates are reduced by smoothing the differences in 
the fit results, using a linear function fit over the whole anal- 
ysis pj range with weights ^/Ni, where N, is the number of 
selected data events in bin i. The smoothed differences in 
the flavour fractions between data fits are taken as system- 
atic uncertainties. In absolute values, they vary from 0.08 % 
in the lowest pj bin to 0.2% in the highest pj bin for the 
fee fractions and from 0.06 % to 1.3% for the feu fractions. 
This systematic uncertainty is significantly smaller for the 
other flavour fractions. 



Another check of the influence of the template shape is 
made by generating templates using Herwig++ instead of 
PYTHIA 6.423. The dedicated simulation model described 
in Sect. [O] is exploited for this study. The PYTHIA 6.423 
fully simulated vertices are used for template creation, but 
pseudo-data are created with Herwig++ vertices. Then the 
standard analysis procedure is applied. The averaged val- 
ues based on 200 pseudo-experiments are compared with the 
initial fast simulation model parameters and the differences 
are considered as systematic uncertainties. Overall, the sys- 
tematic uncertainty due to the template shapes constitutes a 
large contribution to the full systematic uncertainty for all 
flavour fractions, and is similar in size to those from JES 
and secondary vertex reconstruction efficiencies. 

The predictions of the Monte Carlo simulation for the 
amount of heavy flavour in the leading and subleading jets 
differ significantly from one generator to another, as can be 
seen in Fig. [3] In the current analysis the charm production 
asymmetry is fixed to the PYTHIA 6.423 prediction. To de- 
termine the systematic effect of an imprecise description of 
the charm asymmetry, the data fit is redone with the charm 
asymmetry values given by Powheg+Pythia 6.423 as 
shown in Fig. [3] This systematic uncertainty reaches ~ 40 % 
of the total uncertainty for the fee fraction and ~ 20 % for 
the fsc fraction in the high pj region, in all other cases it is 
below ~ 10 %. The admixture of jets with two charm quarks 
inside is also fixed to the PYTHIA 6.423 prediction in the 
analysis. To determine the systematic effect due to this, the 
double-charm admixture is varied by a fixed value, equal 
to 1/3 of the measured double-bottom jet admixture. This 
choice is justified by a comparison of the bottom and charm 
asymmetries in Fig. [3] which are governed by similar QCD 
effects. This systematic uncertainty becomes important for 
the feu and fsu fractions for large pj. It is 1.2% for feu 
and 0.35 % for f BU in the [250, 500] GeV bin. 

To improve the agreement between data and Monte 
Carlo simulation, the flavour template shapes are tuned on 
the 2010 data as described in Sect. 16.41 The systematic un- 
certainties due to this procedure are estimated by repeating 
the full analysis using the fully smeared (F smeM — 1 .0, see 
Sect. 16.41 ) PYTHIA 6.423 dijet sample for template construc- 
tion and definition of the vertex reconstruction efficiencies. 
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This systematic uncertainty is ~ 50 % of the total systematic 
uncertainty for the fsu fraction in the high pj region and 
significantly smaller in other cases. 

The unfolding procedure for obtaining the dijet flavour 
fractions at the truth-particle level is based on estimations of 
the dijet reconstruction efficiencies from Monte Carlo simu- 
lation. Systematic uncertainties on these are estimated using 
the differences in the unfolded flavour fractions calculated 
with the unfolding coefficients predicted by Pythia 6.423 
and Herwig++ 2.4.2. The flavour dijet reconstruction effi- 
ciencies are calculated for each analysis pj bin and there- 
fore also depend on the JES modelling. The changes in the 
unfolded flavour fractions due to the shifted jet energies are 
considered as the JES-induced unfolding systematic uncer- 
tainties. In both cases, the differences in the unfolded flavour 
fractions have significant statistical fluctuations due to the 
fact that the number of Monte Carlo events used for the re- 
construction efficiency estimation is limited. The differences 
for each flavour fraction are therefore smoothed in the same 
way as the template shape systematic uncertainty. In the low 
pj bins the systematic uncertainties due to the unfolding are 
comparable in size to the uncertainties from JES and tem- 
plate shapes for fee, /bu an d feu - In a U other cases they are 
relatively small. 

The full systematic uncertainties on the unfolded dijet 
flavour fractions are presented in Table [2] These uncertain- 
ties are added in quadrature to the statistical uncertainties 
and are shown as shaded bands in Fig. |T0j Except for BU, 
all data fractions are in agreement within the uncertainties 
with the predictions of the LO and NLO generators. The BU 
fraction, while coinciding reasonably well with the Monte 
Carlo simulation predictions at low jet pj, shows disagree- 
ment for jets with pj above ~ 100 GeV. 



9 Conclusions 

An analysis of the flavour composition of dijet events 
has been performed, based on an integrated luminosity of 
39 pb" 1 collected by the ATLAS detector in 2010 at a 
centre-of-mass energy of 7 TeV. The analysis makes use of 
reconstructed secondary vertices in jets, without explicitly 
assigning individual flavours. Instead, kinematic properties 
of the ensemble of tracks associated with a secondary ver- 
tex are used to distinguish between light, charm and bot- 
tom jets. Specially constructed and optimised variables that 
are highly sensitive to the flavour content of jets, have been 
employed. The dijet heavy flavour fractions are determined 
from a multidimensional fit using templates of these vari- 
ables. 

The analysis demonstrates the capability of ATLAS to 
measure the dijet fractions containing bottom jets and the 
more challenging charm jets down to the level of ~ 0.5 %. 



All five dijet final states with heavy flavours are reliably ex- 
tracted and measured as a function of the leading jet pj. 

A significant difference in the bottom hadron con- 
tent between leading and subleading jets is observed. 
This difference is poorly described by the LO generators 
Pythia 6.423 and Herwig++ 2.4.2, whereas the NLO gen- 
erator POWHEG reproduces the data well. 

The data-driven b-jet shape approach used in the fit 
demonstrates a deficiency of the b-jet template obtained with 
Pythia 6.423, particularly in the high jet pj region. An 
increase of the template contribution describing the pres- 
ence of two b-hadrons inside a jet substantially improves 
the agreement between data and Monte Carlo simulation. 

The measurements of the six dijet flavour fractions are 
compared with the predictions of the two LO generators 
Pythia 6.423 and Herwig++ 2.4.2, and also with the NLO 
generator POWHEG. All generator predictions are consistent 
with each other and agree with the measured values, ex- 
cept for the mixed BU dijet fraction, which is systematically 
above all the predictions in the high pj region. 
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Fig. 10 The unfolded dijet flavour fractions for each leading jet pj bin (black points) with PYTHIA 6.423 (squares), Herwig++ 2.4.2 (circles) and 
POWHEG+PYTHIA 6.423 (filled triangles) predictions overlaid. The error bars on the data points show statistical uncertainties only, whereas the 
full uncertainties appear as shaded bands. 
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